A Statistical Framework to Identify Deviation from Time Linearity in Epigenetic Aging

نویسندگان

  • Sagi Snir
  • Bridgett M. vonHoldt
  • Matteo Pellegrini
چکیده

In multiple studies DNA methylation has proven to be an accurate biomarker of age. To develop these biomarkers, the methylation of multiple CpG sites is typically linearly combined to predict chronological age. By contrast, in this study we apply the Universal PaceMaker (UPM) model to investigate changes in DNA methylation during aging. The UPM was initially developed to study rate acceleration/deceleration in sequence evolution. Rather than identifying which linear combinations of sites predicts age, the UPM models the rates of change of multiple CpG sites, as well as their starting methylation levels, and estimates the age of each individual to optimize the model fit. We refer to the estimated age as the "epigenetic age", which is in contrast to the known chronological age of each individual. We construct a statistical framework and devise an algorithm to determine whether a genomic pacemaker is in effect (i.e rates of change vary with age). The decision is made by comparing two competing likelihood based models, the molecular clock (MC) and UPM. For the molecular clock model, we use the known chronological age of each individual and fit the methylation rates at multiple sites, and express the problem as a linear least squares and solve it in polynomial time. For the UPM case, the search space is larger as we are fitting both the epigenetic age of each individual as well as the rates for each site, yet we succeed to reduce the problem to the space of individuals and polynomial in the more significant space-the methylated sites. We first tested our algorithm on simulated data to elucidate the factors affecting the identification of the pacemaker model. We find that, provided with enough data, our algorithm is capable of identifying a pacemaker even when a weak signal is present in the data. Based on these results, we applied our method to DNA methylation data from human blood from individuals of various ages. Although the improvement in variance across sites between the UPM and MC was small, the results suggest that the existence of a pacemaker is highly significant. The PaceMaker results also suggest a decay in the rate of change in DNA methylation with age.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تئوری‌های بیوشیمیایی و ژنتیکی فرایند پیری

Aging is the outcome of the progressive accumulation of different alterations in the body which accompanied with gradual decrease of the efficiencies of normal physiological functions and the capacity to maintain homeostasis that lead to the increase in disease probability and the death of people. The researchers have done different experiments especially on animal models for the perception of ...

متن کامل

A statistical analysis framework for bus reliability evaluation based on AVL data: A case study of Qazvin, Iran

Reliability is a fundamental factor in the operation of bus transportation systems for the reason that it signifies a straight indicator of the quality of service and operator’s costs. Todays, the application of GPS technology in bus systems provides big data availability, though it brings the difficulties of data preprocessing in a methodical approach. In this study, the principal component an...

متن کامل

The Comparison of the Point-of-Care Serum Procalcitonin Assay Method with the BRAHMS Certified Method

Background and Aims: As a method for the diagnosis and management of sepsis, the serum procalcitonin assay is routinely used, especially in the emergency department (ED) and intensive care units (ICU). Procalcitonin has reasonable diagnostic accuracy for bacteremia in hospitalized patients of all age groups with suspected infection or sepsis. This study aimed to compare the Getein Biotech proca...

متن کامل

Geometric approach to string analysis for biosequence classification

Tools that effectively analyze and compare sequences are of great importance in various areas of applied computational research, especially in the framework of molecular biology. In the present paper, we introduce simple geometric criteria based on the notion of string linearity and use them to compare DNA sequences of various organisms, as well as to distinguish them from random sequences. Sev...

متن کامل

Identify direct and indirect nursing care time in a medical and surgical ward

Abstract Introduction: Classifying the average nursing care time is an independent measure for regulate the quantity and quality of the nursing staff in hospitals because, it allows hospitals to evaluate the condition of the existing human resources. This study aimed to identify direct and indirect nursing care time in a medical and surgical ward. Methods: In this cross - sectional study ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2016